Picture for Nong Sang

Nong Sang

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

Add code
Jan 28, 2026
Viaarxiv icon

Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets

Add code
Dec 19, 2025
Viaarxiv icon

Learning to Tell Apart: Weakly Supervised Video Anomaly Detection via Disentangled Semantic Alignment

Add code
Nov 13, 2025
Viaarxiv icon

VideoLucy: Deep Memory Backtracking for Long Video Understanding

Add code
Oct 14, 2025
Figure 1 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 2 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 3 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Figure 4 for VideoLucy: Deep Memory Backtracking for Long Video Understanding
Viaarxiv icon

Learning Unpaired Image Dehazing with Physics-based Rehazy Generation

Add code
Jun 15, 2025
Figure 1 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Figure 2 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Figure 3 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Figure 4 for Learning Unpaired Image Dehazing with Physics-based Rehazy Generation
Viaarxiv icon

ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model

Add code
Jun 11, 2025
Figure 1 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Figure 2 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Figure 3 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Figure 4 for ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model
Viaarxiv icon

MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation

Add code
Apr 20, 2025
Figure 1 for MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
Figure 2 for MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
Figure 3 for MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
Figure 4 for MP-Mat: A 3D-and-Instance-Aware Human Matting and Editing Framework with Multiplane Representation
Viaarxiv icon

Taming Consistency Distillation for Accelerated Human Image Animation

Add code
Apr 15, 2025
Viaarxiv icon

DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification

Add code
Apr 15, 2025
Viaarxiv icon

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Add code
Apr 15, 2025
Viaarxiv icon